Skip to content

Conversation

@danbraunai-apollo
Copy link
Contributor

Description

  • Allows for running APD on simple-stories models.
  • Only includes the loss functions necessary for our initial exploration, and not everything that is supported for tms and resid_mlp.
  • Adds a streamlit dashboard at spd/experiments/lm/app.py to explore the components for a trained model.

How Has This Been Tested?

None! Notably, no tests yet made for whether the tokens accurately match the text.

Does this PR introduce a breaking change?

No

danbraunai-apollo and others added 29 commits March 17, 2025 16:00
* Add layerwise recon

* Add layerwise_random_recon_loss

* Protect the eyes of mathematicians
* WIP: Add dashboard

* Create base_cache_dir if it doesn't exist

* Functional dashboard

* Add simple-stories-train and datasets to pyproject.toml
@danbraunai-apollo danbraunai-apollo merged commit f47c5b4 into dev Apr 22, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants